64 research outputs found

    Toxicity

    Get PDF
    In research on online comments on social media platforms, different terms are widely used to describe comments that are hateful or disrespectful and thereby poison a discussion. This chapter takes a theoretical perspective on the term toxicity and related research in the field of computer science. More specifically, it explains the usage of the term and why its exact interpretation depends on the platform in question. Further, the article discusses the advantages of toxicity over other terms and provides an overview of the available toxic comment datasets. Finally, it introduces the concept of engaging comments as the counterpart of toxic comments, leading to a task that is complementary to the prevention and removal of toxic comments: the fostering and highlighting of engaging comments

    Top Comment or Flop Comment? Predicting and Explaining User Engagement in Online News Discussions

    Full text link
    Comment sections below online news articles enjoy growing popularity among readers. However, the overwhelming number of comments makes it infeasible for the average news consumer to read all of them and hinders engaging discussions. Most platforms display comments in chronological order, which neglects that some of them are more relevant to users and are better conversation starters. In this paper, we systematically analyze user engagement in the form of the upvotes and replies that a comment receives. Based on comment texts, we train a model to distinguish comments that have either a high or low chance of receiving many upvotes and replies. Our evaluation on user comments from TheGuardian.com compares recurrent and convolutional neural network models, and a traditional feature-based classifier. Further, we investigate what makes some comments more engaging than others. To this end, we identify engagement triggers and arrange them in a taxonomy. Explanation methods for neural networks reveal which input words have the strongest influence on our model's predictions. In addition, we evaluate on a dataset of product reviews, which exhibit similar properties as user comments, such as featuring upvotes for helpfulness.Comment: Accepted at the International Conference on Web and Social Media (ICWSM 2020); 11 pages; code and data are available at https://hpi.de/naumann/projects/repeatability/text-mining.htm

    Magnetresonanztomographische Normwerte der links- und rechtsventrikulären Wanddicke, Muskelmasse, des Volumen und der Funktion von Kindern, Jugendlichen und jungen Erwachsenen

    Get PDF
    Im Rahmen der Dissertationsschrift wurden erstmals MRT-Normwerte für die segmentale biventrikuläre myokardiale Wanddicke und Muskelmasse für Kinder und Jugendliche erstellt. Ferner wurden MRT-Referenzwerte für die Ventrikelvolumina und -funktion erhoben. Anstatt klassischer Perzentilenkurven zur Einschätzung der somatischen Entwicklung von Kindern und Jugendlichen, die typischerweise Körpergröße, -gewicht und Kopfumfang ins Verhältnis zum Alter setzen, konnten wir die BSA als bessere Bezugsgröße für die biventrikuläre Wanddicke und Muskelmasse ausmachen. Unsere Ergebnisse können einen entscheidenden Beitrag in der Diagnostik von angeborenen und erworbenen Herzfehlern sowie pathologischen myokardialen Veränderungen leisten und als Referenz im klinischen Alltag und für zukünftige Studien diene

    A Unified System for Aggression Identification in English Code-Mixed and Uni-Lingual Texts

    Full text link
    Wide usage of social media platforms has increased the risk of aggression, which results in mental stress and affects the lives of people negatively like psychological agony, fighting behavior, and disrespect to others. Majority of such conversations contains code-mixed languages[28]. Additionally, the way used to express thought or communication style also changes from one social media plat-form to another platform (e.g., communication styles are different in twitter and Facebook). These all have increased the complexity of the problem. To solve these problems, we have introduced a unified and robust multi-modal deep learning architecture which works for English code-mixed dataset and uni-lingual English dataset both.The devised system, uses psycho-linguistic features and very ba-sic linguistic features. Our multi-modal deep learning architecture contains, Deep Pyramid CNN, Pooled BiLSTM, and Disconnected RNN(with Glove and FastText embedding, both). Finally, the system takes the decision based on model averaging. We evaluated our system on English Code-Mixed TRAC 2018 dataset and uni-lingual English dataset obtained from Kaggle. Experimental results show that our proposed system outperforms all the previous approaches on English code-mixed dataset and uni-lingual English dataset.Comment: 10 pages, 5 Figures, 6 Tables, accepted at CoDS-COMAD 202

    The IgA nephropathy Biobank. An important starting point for the genetic dissection of a complex trait

    Get PDF
    BACKGROUND: IgA nephropathy (IgAN) or Berger's disease, is the most common glomerulonephritis in the world diagnosed in renal biopsied patients. The involvement of genetic factors in the pathogenesis of the IgAN is evidenced by ethnic and geographic variations in prevalence, familial clustering in isolated populations, familial aggregation and by the identification of a genetic linkage to locus IGAN1 mapped on 6q22–23. This study seems to imply a single major locus, but the hypothesis of multiple interacting loci or genetic heterogeneity cannot be ruled out. The organization of a multi-centre Biobank for the collection of biological samples and clinical data from IgAN patients and relatives is an important starting point for the identification of the disease susceptibility genes. DESCRIPTION: The IgAN Consortium organized a Biobank, recruiting IgAN patients and relatives following a common protocol. A website was constructed to allow scientific information to be shared between partners and to divulge obtained data (URL: ). The electronic database, the core of the website includes data concerning the subjects enrolled. A search page gives open access to the database and allows groups of patients to be selected according to their clinical characteristics. DNA samples of IgAN patients and relatives belonging to 72 multiplex extended pedigrees were collected. Moreover, 159 trios (sons/daughters affected and healthy parents), 1068 patients with biopsy-proven IgAN and 1040 healthy subjects were included in the IgAN Consortium Biobank. Some valuable and statistically productive genetic studies have been launched within the 5(th )Framework Programme 1998–2002 of the European project No. QLG1-2000-00464 and preliminary data have been published in "Technology Marketplace" website: . CONCLUSION: The first world IgAN Biobank with a readily accessible database has been constituted. The knowledge gained from the study of Mendelian diseases has shown that the genetic dissection of a complex trait is more powerful when combined linkage-based, association-based, and sequence-based approaches are performed. This Biobank continuously expanded contains a sample size of adequately matched IgAN patients and healthy subjects, extended multiplex pedigrees, parent-child trios, thus permitting the combined genetic approaches with collaborative studies

    Five Glutathione S-Transferase Gene Variants in 23,452 Cases of Lung Cancer and 30,397 Controls: Meta-Analysis of 130 Studies

    Get PDF
    BACKGROUND: Glutathione S-transferases (GSTs) are known to abolish or reduce the activities of intracellular enzymes that help detoxify environmental carcinogens, such as those found in tobacco smoke. It has been suggested that polymorphisms in the GST genes are risk factors for lung cancer, but a large number of studies have reported apparently conflicting results. METHODS AND FINDINGS: Literature-based meta-analysis was supplemented by tabular data from investigators of all relevant studies of five GST polymorphisms ( GSTM1 null, GSTT1 null, I105V, and A114V polymorphisms in the GSTP1 genes, and GSTM3 intron 6 polymorphism) available before August, 2005, with investigation of potential sources of heterogeneity. Included in the present meta-analysis were 130 studies, involving a total of 23,452 lung cancer cases and 30,397 controls. In a combined analysis, the relative risks for lung cancer of the GSTM1 null and GSTT1 null polymorphisms were 1.18 (95% confidence interval [CI]: 1.14–1.23) and 1.09 (95% CI: 1.02–1.16), respectively, but in the larger studies they were only 1.04 (95% CI: 0.95–1.14) and 0.99 (95% CI: 0.86–1.11), respectively. In addition to size of study, ethnic background was a significant source of heterogeneity among studies of the GSTM1 null genotype, with possibly weaker associations in studies of individuals of European continental ancestry. Combined analyses of studies of the 105V, 114V, and GSTM3*B variants showed no significant overall associations with lung cancer, yielding per-allele relative risks of 1.04 (95% CI: 0.99–1.09), 1.15 (95% CI: 0.95–1.39), and 1.05 (95% CI: 0.89–1.23), respectively. CONCLUSIONS: The risk of lung cancer is not strongly associated with the I105V and A114V polymorphisms in the GSTP1 gene or with GSTM3 intron 6 polymorphism. Given the non-significant associations in the larger studies, the relevance of the weakly positive overall associations with the GSTM1 null and the GSTT1 null polymorphisms is uncertain. As lung cancer has important environmental causes, understanding any genetic contribution to it in general populations will require the conduct of particularly large and comprehensive studies

    Mendelian randomisation study of height and body mass index as modifiers of ovarian cancer risk in 22,588 BRCA1 and BRCA2 mutation carriers

    Get PDF
    Funder: CIMBA: The CIMBA data management and data analysis were supported by Cancer Research – UK grants C12292/A20861, C12292/A11174. ACA is a Cancer Research -UK Senior Cancer Research Fellow. GCT and ABS are NHMRC Research Fellows. iCOGS: the European Community's Seventh Framework Programme under grant agreement No. 223175 (HEALTH-F2-2009-223175) (COGS), Cancer Research UK (C1287/A10118, C1287/A 10710, C12292/A11174, C1281/A12014, C5047/A8384, C5047/A15007, C5047/A10692, C8197/A16565), the National Institutes of Health (CA128978) and Post-Cancer GWAS initiative (1U19 CA148537, 1U19 CA148065 and 1U19 CA148112 - the GAME-ON initiative), the Department of Defence (W81XWH-10-1-0341), the Canadian Institutes of Health Research (CIHR) for the CIHR Team in Familial Risks of Breast Cancer (CRN-87521), and the Ministry of Economic Development, Innovation and Export Trade (PSR-SIIRI-701), Komen Foundation for the Cure, the Breast Cancer Research Foundation, and the Ovarian Cancer Research Fund. The PERSPECTIVE project was supported by the Government of Canada through Genome Canada and the Canadian Institutes of Health Research, the Ministry of Economy, Science and Innovation through Genome Québec, and The Quebec Breast Cancer Foundation. BCFR: UM1 CA164920 from the National Cancer Institute. The content of this manuscript does not necessarily reflect the views or policies of the National Cancer Institute or any of the collaborating centers in the Breast Cancer Family Registry (BCFR), nor does mention of trade names, commercial products, or organizations imply endorsement by the US Government or the BCFR. BFBOCC: Lithuania (BFBOCC-LT): Research Council of Lithuania grant SEN-18/2015. BIDMC: Breast Cancer Research Foundation. BMBSA: Cancer Association of South Africa (PI Elizabeth J. van Rensburg). CNIO: Spanish Ministry of Health PI16/00440 supported by FEDER funds, the Spanish Ministry of Economy and Competitiveness (MINECO) SAF2014-57680-R and the Spanish Research Network on Rare diseases (CIBERER). COH-CCGCRN: Research reported in this publication was supported by the National Cancer Institute of the National Institutes of Health under grant number R25CA112486, and RC4CA153828 (PI: J. Weitzel) from the National Cancer Institute and the Office of the Director, National Institutes of Health. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health. CONSIT: Associazione Italiana Ricerca sul Cancro (AIRC; IG2014 no.15547) to P. Radice. Italian Association for Cancer Research (AIRC; grant no.16933) to L. Ottini. Associazione Italiana Ricerca sul Cancro (AIRC; IG2015 no.16732) to P. Peterlongo. Jacopo Azzollini is supported by funds from Italian citizens who allocated the 5x1000 share of their tax payment in support of the Fondazione IRCCS Istituto Nazionale Tumori, according to Italian laws (INT-Institutional strategic projects ‘5x1000’). DEMOKRITOS: European Union (European Social Fund – ESF) and Greek national funds through the Operational Program "Education and Lifelong Learning" of the National Strategic Reference Framework (NSRF) - Research Funding Program of the General Secretariat for Research & Technology: SYN11_10_19 NBCA. Investing in knowledge society through the European Social Fund. DFKZ: German Cancer Research Center. EMBRACE: Cancer Research UK Grants C1287/A10118 and C1287/A11990. D. Gareth Evans and Fiona Lalloo are supported by an NIHR grant to the Biomedical Research Centre, Manchester. The Investigators at The Institute of Cancer Research and The Royal Marsden NHS Foundation Trust are supported by an NIHR grant to the Biomedical Research Centre at The Institute of Cancer Research and The Royal Marsden NHS Foundation Trust. Ros Eeles and Elizabeth Bancroft are supported by Cancer Research UK Grant C5047/A8385. Ros Eeles is also supported by NIHR support to the Biomedical Research Centre at The Institute of Cancer Research and The Royal Marsden NHS Foundation Trust. FCCC: The University of Kansas Cancer Center (P30 CA168524) and the Kansas Bioscience Authority Eminent Scholar Program. A.K.G. was funded by R0 1CA140323, R01 CA214545, and by the Chancellors Distinguished Chair in Biomedical Sciences Professorship. FPGMX: FISPI05/2275 and Mutua Madrileña Foundation (FMMA). GC-HBOC: German Cancer Aid (grant no 110837, Rita K. Schmutzler) and the European Regional Development Fund and Free State of Saxony, Germany (LIFE - Leipzig Research Centre for Civilization Diseases, project numbers 713-241202, 713-241202, 14505/2470, 14575/2470). GEMO: Ligue Nationale Contre le Cancer; the Association “Le cancer du sein, parlons-en!” Award, the Canadian Institutes of Health Research for the "CIHR Team in Familial Risks of Breast Cancer" program and the French National Institute of Cancer (INCa grants 2013-1-BCB-01-ICH-1 and SHS-E-SP 18-015). GEORGETOWN: the Non-Therapeutic Subject Registry Shared Resource at Georgetown University (NIH/NCI grant P30-CA051008), the Fisher Center for Hereditary Cancer and Clinical Genomics Research, and Swing Fore the Cure. G-FAST: Bruce Poppe is a senior clinical investigator of FWO. Mattias Van Heetvelde obtained funding from IWT. HCSC: Spanish Ministry of Health PI15/00059, PI16/01292, and CB-161200301 CIBERONC from ISCIII (Spain), partially supported by European Regional Development FEDER funds. HEBCS: Helsinki University Hospital Research Fund, Academy of Finland (266528), the Finnish Cancer Society and the Sigrid Juselius Foundation. HEBON: the Dutch Cancer Society grants NKI1998-1854, NKI2004-3088, NKI2007-3756, the Netherlands Organisation of Scientific Research grant NWO 91109024, the Pink Ribbon grants 110005 and 2014-187.WO76, the BBMRI grant NWO 184.021.007/CP46 and the Transcan grant JTC 2012 Cancer 12-054. HRBCP: Hong Kong Sanatorium and Hospital, Dr Ellen Li Charitable Foundation, The Kerry Group Kuok Foundation, National Institute of Health1R 03CA130065, and North California Cancer Center. HUNBOCS: Hungarian Research Grants KTIA-OTKA CK-80745 and OTKA K-112228. ICO: The authors would like to particularly acknowledge the support of the Asociación Española Contra el Cáncer (AECC), the Instituto de Salud Carlos III (organismo adscrito al Ministerio de Economía y Competitividad) and “Fondo Europeo de Desarrollo Regional (FEDER), una manera de hacer Europa” (PI10/01422, PI13/00285, PIE13/00022, PI15/00854, PI16/00563 and CIBERONC) and the Institut Català de la Salut and Autonomous Government of Catalonia (2009SGR290, 2014SGR338 and PERIS Project MedPerCan). IHCC: PBZ_KBN_122/P05/2004. ILUH: Icelandic Association “Walking for Breast Cancer Research” and by the Landspitali University Hospital Research Fund. INHERIT: Canadian Institutes of Health Research for the “CIHR Team in Familial Risks of Breast Cancer” program – grant # CRN-87521 and the Ministry of Economic Development, Innovation and Export Trade – grant # PSR-SIIRI-701. IOVHBOCS: Ministero della Salute and “5x1000” Istituto Oncologico Veneto grant. IPOBCS: Liga Portuguesa Contra o Cancro. kConFab: The National Breast Cancer Foundation, and previously by the National Health and Medical Research Council (NHMRC), the Queensland Cancer Fund, the Cancer Councils of New South Wales, Victoria, Tasmania and South Australia, and the Cancer Foundation of Western Australia. MAYO: NIH grants CA116167, CA192393 and CA176785, an NCI Specialized Program of Research Excellence (SPORE) in Breast Cancer (CA116201),and a grant from the Breast Cancer Research Foundation. MCGILL: Jewish General Hospital Weekend to End Breast Cancer, Quebec Ministry of Economic Development, Innovation and Export Trade. Marc Tischkowitz is supported by the funded by the European Union Seventh Framework Program (2007Y2013)/European Research Council (Grant No. 310018). MODSQUAD: MH CZ - DRO (MMCI, 00209805), MEYS - NPS I - LO1413 to LF and by the European Regional Development Fund and the State Budget of the Czech Republic (RECAMO, CZ.1.05/2.1.00/03.0101) to LF, and by Charles University in Prague project UNCE204024 (MZ). MSKCC: the Breast Cancer Research Foundation, the Robert and Kate Niehaus Clinical Cancer Genetics Initiative, the Andrew Sabin Research Fund and a Cancer Center Support Grant/Core Grant (P30 CA008748). NAROD: 1R01 CA149429-01. NCI: the Intramural Research Program of the US National Cancer Institute, NIH, and by support services contracts NO2-CP-11019-50, N02-CP-21013-63 and N02-CP-65504 with Westat, Inc, Rockville, MD. NICCC: Clalit Health Services in Israel, the Israel Cancer Association and the Breast Cancer Research Foundation (BCRF), NY. NNPIO: the Russian Foundation for Basic Research (grants 17-54-12007, 17-00-00171 and 18-515-12007). NRG Oncology: U10 CA180868, NRG SDMC grant U10 CA180822, NRG Administrative Office and the NRG Tissue Bank (CA 27469), the NRG Statistical and Data Center (CA 37517) and the Intramural Research Program, NCI. OSUCCG: Ohio State University Comprehensive Cancer Center. PBCS: Italian Association of Cancer Research (AIRC) [IG 2013 N.14477] and Tuscany Institute for Tumors (ITT) grant 2014-2015-2016. SEABASS: Ministry of Science, Technology and Innovation, Ministry of Higher Education (UM.C/HlR/MOHE/06) and Cancer Research Initiatives Foundation. SMC: the Israeli Cancer Association. SWE-BRCA: the Swedish Cancer Society. UCHICAGO: NCI Specialized Program of Research Excellence (SPORE) in Breast Cancer (CA125183), R01 CA142996, 1U01CA161032, P20CA233307, American Cancer Society (MRSG-13-063-01-TBG, CRP-10-119-01-CCE), Breast Cancer Research Foundation, Susan G. Komen Foundation (SAC110026), and Ralph and Marion Falk Medical Research Trust, the Entertainment Industry Fund National Women's Cancer Research Alliance. Mr. Qian was supported by the Alpha Omega Alpha Carolyn L. Cuckein Student Research Fellowship. UCLA: Jonsson Comprehensive Cancer Center Foundation; Breast Cancer Research Foundation. UCSF: UCSF Cancer Risk Program and Helen Diller Family Comprehensive Cancer Center. UKFOCR: Cancer Research UK. UPENN: Breast Cancer Research Foundation; Susan G. Komen Foundation for the cure, Basser Center for BRCA. UPITT/MWH: Hackers for Hope Pittsburgh. VFCTG: Victorian Cancer Agency, Cancer Australia, National Breast Cancer Foundation. WCP: Dr Karlan is funded by the American Cancer Society Early Detection Professorship (SIOP-06-258-01-COUN) and the National Center for Advancing Translational Sciences (NCATS), Grant UL1TR000124.Abstract: Background: Height and body mass index (BMI) are associated with higher ovarian cancer risk in the general population, but whether such associations exist among BRCA1/2 mutation carriers is unknown. Methods: We applied a Mendelian randomisation approach to examine height/BMI with ovarian cancer risk using the Consortium of Investigators for the Modifiers of BRCA1/2 (CIMBA) data set, comprising 14,676 BRCA1 and 7912 BRCA2 mutation carriers, with 2923 ovarian cancer cases. We created a height genetic score (height-GS) using 586 height-associated variants and a BMI genetic score (BMI-GS) using 93 BMI-associated variants. Associations were assessed using weighted Cox models. Results: Observed height was not associated with ovarian cancer risk (hazard ratio [HR]: 1.07 per 10-cm increase in height, 95% confidence interval [CI]: 0.94–1.23). Height-GS showed similar results (HR = 1.02, 95% CI: 0.85–1.23). Higher BMI was significantly associated with increased risk in premenopausal women with HR = 1.25 (95% CI: 1.06–1.48) and HR = 1.59 (95% CI: 1.08–2.33) per 5-kg/m2 increase in observed and genetically determined BMI, respectively. No association was found for postmenopausal women. Interaction between menopausal status and BMI was significant (Pinteraction < 0.05). Conclusion: Our observation of a positive association between BMI and ovarian cancer risk in premenopausal BRCA1/2 mutation carriers is consistent with findings in the general population

    Polygenic risk scores and breast and epithelial ovarian cancer risks for carriers of BRCA1 and BRCA2 pathogenic variants

    Get PDF
    Purpose We assessed the associations between population-based polygenic risk scores (PRS) for breast (BC) or epithelial ovarian cancer (EOC) with cancer risks forBRCA1andBRCA2pathogenic variant carriers. Methods Retrospective cohort data on 18,935BRCA1and 12,339BRCA2female pathogenic variant carriers of European ancestry were available. Three versions of a 313 single-nucleotide polymorphism (SNP) BC PRS were evaluated based on whether they predict overall, estrogen receptor (ER)-negative, or ER-positive BC, and two PRS for overall or high-grade serous EOC. Associations were validated in a prospective cohort. Results The ER-negative PRS showed the strongest association with BC risk forBRCA1carriers (hazard ratio [HR] per standard deviation = 1.29 [95% CI 1.25-1.33],P = 3x10(-72)). ForBRCA2, the strongest association was with overall BC PRS (HR = 1.31 [95% CI 1.27-1.36],P = 7x10(-50)). HR estimates decreased significantly with age and there was evidence for differences in associations by predicted variant effects on protein expression. The HR estimates were smaller than general population estimates. The high-grade serous PRS yielded the strongest associations with EOC risk forBRCA1(HR = 1.32 [95% CI 1.25-1.40],P = 3x10(-22)) andBRCA2(HR = 1.44 [95% CI 1.30-1.60],P = 4x10(-12)) carriers. The associations in the prospective cohort were similar. Conclusion Population-based PRS are strongly associated with BC and EOC risks forBRCA1/2carriers and predict substantial absolute risk differences for women at PRS distribution extremes.Peer reviewe

    Transcriptome-wide association study of breast cancer risk by estrogen-receptor status

    Get PDF
    Previous transcriptome-wide association studies (TWAS) have identified breast cancer risk genes by integrating data from expression quantitative loci and genome-wide association studies (GWAS), but analyses of breast cancer subtype-specific associations have been limited. In this study, we conducted a TWAS using gene expression data from GTEx and summary statistics from the hitherto largest GWAS meta-analysis conducted for breast cancer overall, and by estrogen receptor subtypes (ER+ and ER-). We further compared associations with ER+ and ER- subtypes, using a case-only TWAS approach. We also conducted multigene conditional analyses in regions with multiple TWAS associations. Two genes, STXBP4 and HIST2H2BA, were specifically associated with ER+ but not with ER- breast cancer. We further identified 30 TWAS-significant genes associated with overall breast cancer risk, including four that were not identified in previous studies. Conditional analyses identified single independent breast-cancer gene in three of six regions harboring multiple TWAS-significant genes. Our study provides new information on breast cancer genetics and biology, particularly about genomic differences between ER+ and ER- breast cancer.Peer reviewe
    corecore